CDS

Accession Number TCMCG075C24106
gbkey CDS
Protein Id XP_007018993.2
Location complement(join(6144603..6144815,6145594..6145921,6147168..6147628))
Gene LOC18592290
GeneID 18592290
Organism Theobroma cacao

Protein

Length 333aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007018931.2
Definition PREDICTED: flavonol synthase/flavanone 3-hydroxylase [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category Q
Description Belongs to the iron ascorbate-dependent oxidoreductase family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R02160        [VIEW IN KEGG]
R03126        [VIEW IN KEGG]
R06539        [VIEW IN KEGG]
R08082        [VIEW IN KEGG]
KEGG_rclass RC00657        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K05278        [VIEW IN KEGG]
EC 1.14.11.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00941        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00941        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGAGGTGGAGAGAGTGCAAGGCATTGCTAATTTTTCCACAGAAACAATCCCGGAAGAGTTCATTCGGTCGACGAATGAGCAGCCAGGGCTCACAACGGTTCAGGGGACCGTGCTGGAGGTCCCGGTCATCGATCTCAGCGACCCTGATGAAAAAAAGATGCTCGAAGCCATTATAGACGCGAGCCGTAATTGGGGGATATTCCAGGTTGTGAATCATGGCATCCCTGATGAGGTCATAAGGAAGCTGCAAGAGGCTGGCAAGGTTTTCTTTGAGCTCCCGCAAGAGGAAAAGGAGCTTTACGCCAAGCCTCCTGGATCTCAAAGCATCGAAGGGTATGGAACTAAGCTTCAAAAAGAACTGCAAGGGAAGAAAGCTTGGGTTGATCATCTATTCCACAAGATATGGCCTCCTCGTGAAATTAACTATCAGTTTTGGCCTAAAAATCCACCTTCTTACAGAGAGGCAAACGAAGAATACACAAAGCACATGCATGGGGTGGTAGACAAACTGTTTAGATGCCTTTCAGTAGGCTTAGGGCTGGAAGGGCATGAGTTGAAGGAGGCCGTGGGCGGTGAGAACTTGGTCTACCTTCTCAAGATCAACTACTATCCGCCGTGCCCACGGCCCGACCTGGCTCTCGGAGTGCCGTCTCACACCGACATGTCCTCCCTCACCATACTGGTTCCCAATGACGTGCAGGGCCTCCAAGCTAATAGAGATGGCCATTGGTATGATGTCAAATACATCCCTAACGCCCTCGTTATTCACATTGGTGATCAAGTCGAGATTGCGAGCAATGGCATGTATAGGAGTGTACTTCACAGAACCACAGTGAACAAAGAGCAAACAAGGATTTCATGGCCAGTGTTCTTGGAGCCACCATCAGACTTAGAAGTAGGACCTCACCCTAAGCTCGTTAATGAAGCAAATCCACCCAAATACAAAACCAAAAAGTACCGTGAATACTGTTACTGCAAGCTCAATAAGATTCCCCAGTAA
Protein:  
MEVERVQGIANFSTETIPEEFIRSTNEQPGLTTVQGTVLEVPVIDLSDPDEKKMLEAIIDASRNWGIFQVVNHGIPDEVIRKLQEAGKVFFELPQEEKELYAKPPGSQSIEGYGTKLQKELQGKKAWVDHLFHKIWPPREINYQFWPKNPPSYREANEEYTKHMHGVVDKLFRCLSVGLGLEGHELKEAVGGENLVYLLKINYYPPCPRPDLALGVPSHTDMSSLTILVPNDVQGLQANRDGHWYDVKYIPNALVIHIGDQVEIASNGMYRSVLHRTTVNKEQTRISWPVFLEPPSDLEVGPHPKLVNEANPPKYKTKKYREYCYCKLNKIPQ